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Abstract 

We present a guidance by repulsion model based on a driver-evader interaction 
where the driver, assumed to be faster than the evader, follows the evader but cannot 
be arbitrarily close to it, and the evader tries to move away from the driver beyond a 
short distance. The key ingredient allowing the driver to guide the evader is that the 
driver is able to display a circumvention maneuver around the evader, in such a way 
that the trajectory of the evader is modified in the direction of the repulsion that the 
driver exerts on the evader. The evader can thus be driven towards any given target 
or along a sufficiently smooth path by controlling a single discrete parameter acting 
on driver’s behavior. The control parameter serves both to activate/deactivate the 
circumvention mode and to select the clockwise/counterclockwise direction of the 
circumvention maneuver. Assuming that the circumvention mode is more expensive 
than the pursuit mode, and that the activation of the circumvention mode has a 
high cost, we formulate an optimal control problem for the optimal strategy to drive 
the evader to a given target. By means of numerical shooting methods, we find the 
optimal open-loop control which reduces the number of activations of the circum¬ 
vention mode to one and which minimizes the time spent in the active mode. Our 
numerical simulations show that the system is highly sensitive to small variations 
of the control function, and that the cost function has a nonlinear regime which 
contributes to the complexity of the behavior of the system, so that a general open- 
loop control would not be of practical interest. We then propose a feedback control 
law that corrects from deviations while preventing from an excesive use of the cir¬ 
cumvention mode, finding numerically that the feedback law significantly reduces 
the cost obtained with the open-loop control. 


Keywords: Guidance by repulsion, Driver-evader agents, Optimal strategies, Feedback 
control law. Nonlinear dynamics. Numerical simulations 
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1 Introduction 


“F 0 // 0 W me” (FM) is probably the most natural strategy to solve the guidance problem, 
that is, to guide something or somebody along a given trajectory or towards a specihc 
target, whether physically along geographical paths (streets, roads, buildings), concep¬ 
tually (language learning, politics) or even spiritually (religion, social networks). The 
success of the FM strategy is based on the effect of the attraction that the guide (the 
leader, the driver) exerts on the guided (followers, driven). Also successful! is the “T>o as 
I do'' strategy, based on imitation (of behavior) or alignment (of opinion), and used in 
the above mentioned helds, among others (social learning). 

Less expected is the effectiveness of the opposite strategy, namely, “move away from 
me" (MA), based on repulsive interactions. That repulsion can serve to guide something 
or somebody is shown by nature itself, not only by means of gradient helds (electromag¬ 
netic, temperature or chemical helds), where attraction towards high densities can be 
viewed as repulsion from low densities, but also by specihcally repelling targets or agents. 
In neural development, axonogenesis takes place by combining attractive and repulsive 
guidance, so that the axon growth follows guidance cues presented by chemoattractant 
and chemorepellent molecules located in the environment of the cell mizi. In animal herd¬ 
ing, sheepdogs are used to guide sheep hocks through a repulsive force that dogs exert on 
sheep |3]. Guidance cues can also be magnetic, as in drug targeting aE], thermotactic, 
as in sperm guidance [6], cognitive, as in crowd motion and traffic how [HE] or in opinion 
formation [5], acoustic signals, as in animal alarm calls or instrumental conditioning [3], 
food trail pheromones in ants [TU] (chemical at a scale larger than the cell), etc. 

Attractive and alignment guidance problems are being studied for a long time by means 
of agent based models [IIl[l2l[l3l[Il[l5l[l6l[I71[l8l[l9ll2ni[2ll[22ll23l[Mll25l[26lE^ 
with attention to guidance by leadership [28l |29l |30], and optimal strategies to minimize 
time guidance (optimal evacuation times) or distance travelled by the agents have been 
found for several systems [32l [3ll [Ml [35l [36] • 

Very recently, in 2015, the optimal strategy for a hocking model to reach a target 
point or to follow a given trajectory through attractive and alignment guidance has been 
presented | 37 |. In this model, individuals from the hock interact through attractive- 
repulsive and alignment forces with the rest of individuals. Interactions are symmetric, 
except for one specihc individual -the leader- which exerts on each other individual an 
extra attractive force. The result is that the leader is followed by the hock, so that, by 
controlling the behavior of the leader, a FM strategy can be used to make the hock reach 
a given target or move along a given trajectory. 

Repulsive forces in attractive-repulsive models have mostly been considered for colli¬ 
sion avoidance with obstacles or with other agents. Guidance by repulsion has received 
much less attention, which is reduced, to our knowledge, to the above mentioned axono- 
genesis and animal herding. Also very recently, in 2015, an agent based model has been 
introduced to describe a so-called defender-intruder interaction, where repulsion is used 
by a defender to expel an intruder away as far as possible from a protected target [38] . 
The authors in [38] hnd an optimal MA strategy, which not always consists in approaching 
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the intruder as close as possible, but to simply drive the intruder away beyond a short 
distance of security. Repulsion in the intruder-defender interaction is qualitatively dif¬ 
ferent than in collision avoidance or in interception [321 SO], where the attractive and/or 
alignment forces determine (most of) the behavior. 

Defender-intruder problems fall into the category of “conflicting interactions” |38] . 
which are well described by the classical pursuit-evasion (PE) framework [HI |12]. The 
simplest scenario for a PE interaction consists of a single pursuer that follows and tries 
to capture a single evader that tries to escape to inhnite from the pursuer [38] . 

Although our interest does not focus on conflicting interactions, we adopt here the 
PE framework. The guidance by repulsion can indeed be described with the simple two- 
agents PE framework, provided two considerations are taken into account: hrst, the guide 
is not exactly a pursuer, as it often separates from the direction towards the evader and 
the guide cannot be arbitrarily close to the evader, and second, the agent to be guided is 
not exactly an evader, as it doesn’t try necessarily to escape to inhnite but simply moves 
away a short distance from the repelling guide. 

We present here a guidance by repulsion model based on the two-agents PE framework. 
We will refer to the guiding agent as the driver, which tries to drive the evader. The driver 
thus follows the evader but cannot be arbitrarily close to it. This is especially interesting 
if the driver cannot approach the evader or contact between agents should be avoided 
(because of chemical reactions, animal conhict, etc). The evader moves away from the 
driver but doesn’t try to escape beyond a not so large distance. The driver is of course 
faster than the evader. At a critical short distance, the driver can display a circumvention 
maneuver around the evader that forces the evader to change the direction of its motion. 
Thus, by adjusting the onset and offset of the circumvention maneuver, the evader can 
be driven towards a desired target or along a given trajectory. Our goal is to hnd optimal 
strategies to drive the evader in the most efficient way. 

We use an inertial model where interactions between agents take place through asym¬ 
metric newtonian forces. The asymmetry consists in that one agent is attracted and 
repulsed, while the other is simply repulsed. This kind of interaction has been coined 
“anti-newtonian” by Sprott [I9] and others [201 EH]- Thus, velocities are not constant 
(they depend on the state of the system), and no alignment forces are considered. 

We denote by D and E (and indexes d and e) the driver and the evader agents 
respectively. Both agents obeys the Newton’s second law, that is, Ui(t) = Vi(t) and 
miVi{t) = where Ui{t) = {uf (t), uf (t)) e M?, Vi{t) = (nf (t), nf (t)) =e M^, m* G M+ 

and Eiit) = {Ef it), {t)) G denote the position vector, the velocity vector and the 

mass of agent i, and the resultant force to which agent i is subject, respectively, for 
i = d,e. 

The force acting on the evader Ee{t) has only one component, which is in the direction 
of escape from the pursuer, Ud—Ue- The force acting on the pursuer Edit) has a component 
collinear to Ee{t) and a lateral perpendicular component {ud — Ug)^ which allows the 
pursuer to surround the evader, therefore forcing the evader to change the direction of 
escape. Here {x,y)^ = {—y,x). 
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The perpendicular component of the force acting on the pursuer can be switched 
on and off by means of the control parameter K{t) which takes values on { — 1, 0,1}. The 
control parameter K{t) is the key ingredient of the model, as it determines the behavior of 
the pursuer, which in turn determines the behavior of the evader. The resulting dynamical 
system can be considered as a driver-evader system with two operating modes controlled 
by a single parameter. 

Contents of the paper and sketch of the results Section titles are self-explanatory. 

In Sec. 2, we introduce the model equations and parameters, describing in detail the 
interactions between agents and with the environment. We show that the driver-evader 
system can be viewed as having two operating modes controlled by a single parameter, 
so that the system can be moved from one state to another in order to make the driver 
to guide the evader from any point to any other point. We study the controllability of 
the system, and prove that, although the system is not fully controllable, agents positions 
remain asymptotically close to each other (he., an agent can not go to inhnity). 

In Sec. 3, we consider what are the optimal strategies which allow the driver to guide 
the evader to a desired target. Activating a system has a cost, as well as keeping it in the 
active mode. As an illustrating example, the driver can be viewed as a spacecraft with 
two lateral propellers whose ignition process and fuel consumption are very high with 
respect to the consumption of the back propeller. Our interest is in reducing the cost by 
minimizing 1) the number of activation and 2) the time of use of the lateral propellers. 

To do that, we formulate an optimal control problem (OCP) for a cost functional 
including these two costs. Whe then hnd, in Sec. 3.1, the (unique) optimal strategy al¬ 
lowing the reach the target in an initially active system (^(to) = 1)) cind the (unique) 
optimal strategy which reduces the length of the time interval [toNjtoFp] during which the 
control parameter has a non-zero value. This second strategy also reduces the number 
of manipulations of the control to two (one switch on and one switch off). In this mini¬ 
mization process, the cost of back propellers (which are always switched on, the driver is 
a self-propelled agent which is always attracted by the evader) is neglected compared to 
the cost of using the lateral propellers, so that there is not (too much) concern by long 
trajectories or long execution times where lateral propellers are switched off. 

These open-loop controls are subject to the reproducibility of the initial and environ¬ 
mental conditions and to uncertainty about the model [321 [3T] . In fact, we show here in 
detail that the system is highly sensitive to small variations of the values of the control, 
thus suggesting the appropriateness of the use of closed-loop controls that can afford for 
the random perturbations arising in real systems. 

We then present in Sec. 3.2 a feedback control law allowing to drive the evader to any 
desired target with an arbitrary accuracy. This feedback law is especially advantageous 
as it yields a similar cost than the open-loop controls, in real conditions (he., under 
perturbations). Moreover, the feedback control law provides an excellent insight for the 
search of a control function yielding a substantially lower cost; as an example, we report 
numerical simulations of a case in which the resulting cost is reduced almost a 60% of the 
cost provided by the open-loop controls. 
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Finally, in Sec. 4, we present our conclusions and we comment on future directions, 
in the last section 


2 Model formulation and first analysis 


2.1 Equations and parameters 

The system consists of 8 ordinary differential equations (ODEs) and 8 initial conditions 
for the components uf(t), vf{t) and for i = d,e. Expanding the resultant 

force acting on the zth-agent J^i{ud{t),Ue{t),Vdit),Veit)), the system can be written as the 
following 4 vectorial ODEs with 4 initial conditions: 


Ud{t) = Vd{t), 
Ueit) = Veit), 


Vdit) = — 
rrid 


-c 


-a 


R 


E ^dit) ^e(^) 

\\Udit)-UeiW 


\\Udit) - Ueit)\\‘ 


Udit) - Ueit) - Kit)52 


jUd - Ue)^ 
\Ueit) - Udit)\\ 


Veit) = 


c 


E Ueit) Udit) 
me \\Ueit) - Udit)\\‘^ 


- VeVeit) 


Udih) = Ueito) = Vdito) = 0 and f4(fo) = 0. 


( 1 ) 

( 2 ) 


VdVdit) , 

( 3 ) 

( 4 ) 

( 5 ) 


Interactions between agents are as follows. 

The expression between brackets in Eq. ([3]) consists of two terms with respective 
coefficients O® and Cr. The hrst term corresponds to the long-range attraction and 
short-range repulsion force that the evader exerts on the driver. Here 5c is the distance 
at which the attraction balances the repulsion: if ||'Ue(^) ~ Udit)\\ > 5c, then the evader 
attracts the driver (and therefore the driver accelerates towards the evader), while if 
ll'Ue(^) ~ '*^d(^)ll < ^c, then the driver is repulsed by the evader (and the driver decelerates 
in the direction opposed to the evader). 

The term with coefficient Cr corresponds to the circumvention force, which as a 
component collinear to the attraction-repulsion interaction with the evader, given by 
Ud — Ue, and a component perpendicular to this direction, denoted by iud — Ue)^, where 
ix,y)^ = i—y,x). The circumvention force is thus a force that attracts the driver to¬ 
wards one of the two sides of the evader, where the side is determined by the sign of the 
parameter ^(t). 

The model considers two other critical distances, 5i, which is the (short) distance 
at which the intensity of the circumvention force is effective, and 52, which denotes the 
distance between agents left by the driver during the circumvention maneuver. In (|1]), the 
evader is simply subject to the repulsion from the driver, which has the same expression 
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than the force that the evader exerts on the driver bnt with a different (and smaller) 
coefficient C^. 

Finally, both agents are snbject to friction forces with the gronnd, which have the 
same form, in the opposite direction of the motion: —r'jh), for i = d,e. 

The model can be formnlated in different ways. The formnlation presented in ffTl)-l|5|) 
is based on the general expressions of the attraction-repnlsion forces introdnced by Gazi 
& Passino [22l |23], later widely used in realistic (biological) models [25l 1201 EH EH l27] . 
Agents are prevented from collisions by means of strong short-range repulsive forces (large 
exponents of \\ue{t) — Ud(t)\\ in the denominator), so that, although it is theoretically 
possible, in practice, the denominators are never (too close to) zero, and the model is 
in this sense well-posed. Another advantage of this formulation is that each ingredient 
of the model, especially the circumvention behavior, appears explicitly in the equations, 
thus facilitating the realistic interpretation of the contribution of each component of the 
model. For example, the exponent —4 of \\ud — Mell in the circumvention force (which is 
large compared to the exponent —2 in the attracting force) shows that the circumvention 
maneuver will only be effective when both agents are sufficiently close to each other. 

With this formulation, the control parameter K{t) appears as a factor of the perpen¬ 
dicular component of the force acting on the evader, and, by the simple choice of a value 
in { — 1, 0,1}, practically determines the behavior of the system. 


2.2 Two operating modes, a single control parameter 

Let us dehne the instantaneous distance between agents r(t) = \\ud(t) — 'We(^)||- 

Then, omitting time-dependence to lighten notation, Eqs. (jSD^dll) can be rewritten as 


CE 


Vd = 


f = 


D '^d 


Up 


rud r" 

^d 


1 + 4(S|'5J 


'^D 


K 


Gr {Ud - Ue)- 


irid 


Ed ^ 
— Vd, 
rrid 


rrip 


Ee ^ 

- Ve- 

rrip 


( 6 ) 

(7) 


We have solved the system ([I])-([5]) numerically with several classical methods (Runge- 
Kutta, adaptive or not, Crank-Nicolson), finding that a simple explicit Euler method 
with time-step of order 10“® is sufficiently effective and provides a sufficiently accurate 
solution for our purposes. Our results are based on exhaustive numerical simulations for 
a wide range of the parameters preserving the signihcance of the model. The behavior 
of the model is stable and coherent under reasonable variations of the parameters. The 
results we describe here are not dataset dependent. The values we have chosen are those 
providing the more illustrative hgures. 

Eq. ([H]) shows that a necessary condition to have an effective short-range repulsion 
acting on the driver is therefore that Gr(5^ — C^5‘l is negative, as it is the case for the 
dataset considered in our study. Similarly, in order to have a faster and more reactive 
behavior of the driver, we assume that and that rud and Ud are sufficiently 

smaller than rUe and Up respectively. 
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For the numerical simulations and graphical descriptions presented here, we have con¬ 
sidered the following dataset for the typical values of the constant parameters of the 
model: hrst, rrii and z/j are the mass and the friction coefficients of agent i respectively, 
with rud = 0.4, rrie = 1, = 1 and Ue = 2; then, = 3 is the coefficient of the 

attractive-repulsive force that the evader exerts on the driver, = 2 is the coefficient 
of the repulsive force that the driver exerts on the evader, and Cr = 0.5 is the coefficient 
of the circumvention component of the force exerted on the driver when k = ±1. Finally, 
the critical distances are as follows: Sc = 2, Si = 2 and S 2 = 2. 

Parameter values are dimensionless (of order one) and chosen so that their relative 
value allow the system to reproduce the behavior of a realistic driving phenomenon. 

The numerical simulations allow us to establish that the system has two operating 
modes, depending on the value of k: 

• Pursuit mode {k = 0): the driver D pursues the evader E, which moves away from 
D. Both agents tend to move in the same direction, given by their acceleration 
vectors, along the straight line DE. If k = 0 continuously during a sufficiently long 
interval of time, both agents’ velocities converge asymptotically to the same vector 
of norm Uas (due to the friction), and the driver stays at a constant distance from 
the evader, 5as- See Appendix A for an analytical estimate of these values, and 
the left panels in Fig. [1] the lower panel shows that, before reaching the constant 
value Vas ~ 0.71, the pursuer is faster than the evader. For the values we have used, 
Sas = \/2. 

• Circumvention mode {k = ±1): the driver D separates from the straight line DE 
and starts a circumvention maneuver around the evader E (clockwise or counter¬ 
clockwise, depending on the sign of k). The response of the evader is to move away 
from the driver. If k = 1 continuously during a sufficiently long interval of time, 
as the driver is faster than the evader, the asymptotic behavior of the system is a 
circular motion of the evader around a hxed point and a circular motion of the driver 
around the circle described by the evader. See Fig. [H where the right panels show 
that, for t G [0, 80], the system tends to a periodic conhguration where both agents 
are separated by a constant distance 5“® ~ 1.82 (not shown in the hgure) and have 
the same angular velocity of norm u^s ~ 7r/4 (the period s of the oscillations is 
s = 27i/ojas', bottom-right panel of Fig. [1] shows that s ~ 8). 

When the driver is sufficiently close to the evader, the circumvention mode is effective 
and triggers the circular behavior of the agents. See the right panels of Fig. [T], where the 
driver initial position at (—6, 0) is so far from the initial position of the evader (6, 0) that, 
although the control is set to k = 1 from to = 0 to tf = 100, the first part of the trajectory 
is almost a straight line. Until t ^ 32, both agents are almost still in the horizontal axis: 
Ud = (5, —0.09), Ue = (9.7, 0.02). At t ~ 37, Ud = (8.62, —0.37) and Ue = (11.05, 0.1), the 
driver is close enough to the evader and the circular behavior becomes perceptible. See 
the oscillations of u^(t) and ul(t), of period s ^ 40/5 = 8, in the right-bottom panel of 
Fig.E 
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Figure 1: The two operating modes of the system. Left panels: pursuit mode {k = 0). 
Right panels: circumvention mode (k = 1). Initial conhguration, in both cases: Ud = 
(—6,0), Ue = (6,0) and ut = (1,1), with zero initial velocities. Upper panels: agents’ 
trajectories. Arrows denote the direction of motion of the agents. For sufficiently long 
times, the velocities converge asymptotically to a constant value Vg- When k = 1, the 
driver turns counterclockwise around the evader, which also turns counterclockwise. Left- 
bottom panel: time variation of the norm of the velocities 1117^,6(^)11, both reaching v^s ~ 
0.71 at t ~ 38.9. Right-bottom panel: time variation of agents’ ordinates: M^(t) < 0.1 
until t ^ 32, while M|(t) < 0.1 until t ^ 37. For t > 40, periodic behavior (of period 
s ~ 8) with constant angular velocity Was ~ vr/4 and constant separation 5“® 1.82. 


Remark 1: When the driver is far from the evader (he., r ^ 1), the term between 
brackets in Eq. ([6]) is such that 1 S> l/r^, so the first term of this equation (which has 
coefficient C^) is of order while the term with nit) is of order 0(r“^). Then, 

Eq. ([6]) can be reduced to 


CEud- Ue l^d ^ 

Vd = - 5 - Vd- 

irid nid 


( 8 ) 


Remark 2: In particular, it can be observed that, when ^ ^ 2 , then 


CjlSf Ud - Ue 


Cr5% {Ud - Ue)^ 

rud 

rudr 

rrid 


CR 6 f 62 

d ’ 

rridr^ 


(9) 


SO that the term deviating the driver from the pure pursuit trajectory is negligible with 
respect to the term corresponding to the attracting force exerted by the evader. Therefore, 
when the driver is sufficiently far from the evader, the value of K{t) has no influence on 
the behavior of the driver, meaning that k can be set to zero. 




























2.3 On the controllability of the system 

The circumvention mode can be viewed as the active state of the system, where the control 
parameter k is set to ON, while the pursuit mode is the rest state of the system, where 
K is set to OFF. With the appropriate combination of both modes, the driver is able to 
make the evader reach any given target point or move along any (relatively smooth) given 
trajectory. The resulting behavior of such combination is what we call a driving behavior. 
See Fig. [2J 



Figure 2: Agents’ trajectories (driver: small black circles; evader: large red circles) for an 
admissible control function K{t). Left: whole trajectories, right: zoom for u^{t) G [0,15]. 
Symbols are equidistant in time; filled symbols labeled with the same number denote same 
instant of time (evader’s labels are shown in parentheses). 

This strongly suggests that the system can be controlled with a single control parameter, 
K{t). In control theory, a system is said to be fully controllable when, starting from any 
arbitrary initial state, every possible state of the system can be reached by appropriately 
adjusting the control parameters (see, e.g., [33]). In this sense, our numerical simulations 
have shown that agents cannot be separated an arbitrarily long distance, so that the 
driver-evader system cannot be labelled as fully controllable. 

In fact, we prove analytically in Appendix A that, for any initial state, the separation 
between agents is bounded by above and tends to a distance (of order 1), showing that the 
evader cannot escape from the driver to infinity, and the driver cannot move away from 
the evader to inhnity. Moreover, numerical simulations show that agents’ velocities are 
also bounded and that, when K{t) remains unchanged for sufficiently long time intervals, 
both velocities tend asymptotically to the same constant value {vas or (Uas). Finally, agents 
are prevented from occupying the same place at the same time (he., Ud(t) = Ue(t)) by the 
strong short-range repulsion force that the driver exerts on the evader, as our numerical 
simulations confirm. 

We will say instead that the driver-evader system is partially controllable, in the sense 
that each agent can be controlled separately: the driver can force the evader to reach any 
point in the plane (this is shown in the successive sections), and vice-versa, a series of 
targets for the evader can be selected so that the driver is driven to reach any point in 
the plane. 

The question arises now as how the driver-evader system behavior can be optimized 
to minimize a given cost functional accounting for the use of the lateral propellers. 


9 








3 Optimal control: two optimal open-loop controls, 
one feedback control law 


Denote by Bp{T) a ball of radius p centered in T, assume that the evader is initially far 
from the driver {e.g., r(to) = ||'Ud(to) — We(^o)|| <^i), and assume also that D, E and the 

ball Bp{T) are not aligned, that is, DE fl Bp{T) = 0. 

Consider now the objective of driving the evader E into the target ball Bp{T) at a 
hnal time t/ by controlling the driver D with an appropriate strategy K{t). As D, E and 
Bp{T) are not aligned, the circumvention mode has to be activated and/or has to remain 
active a suitable time to modify the trajectory of the evader and guide it towards the 
target ball Bp(T). 

Such an objective must appraise the cost of 1) forcing the system to leave its resting 
state, and/or 2) keeping the system in an active state. This cost is given by the functional 


J{k) aiNig{K) + (J2C(k), 


( 10 ) 


where is the number of times that the system is forced to leave its resting state 

in the time interval [to,tf] (that is, the number of ignition processes, where K{t) changes 
from 0 to ±1), and C(k) is the total time during which the system is active in [to, tf] (he., 
the time spent with a lateral propeller in active mode, where K{t) has a non zero value), 

r^f 

C{k) = / \K{t)\dt, (11) 

Jto 

and (Ji 2 are nonnegative weights hxed to balance the contribution of each partial cost. 
An optimal control problem (OCP) can then be formulated as 


(OCP) 


Min J(fi;) = aiNig^n) + a 2 C{K,) 

^ £ Ua.d = : [0)^/] {“1)0)1} such that Ue{tf) G Bp{T)'^ ’ 


( 12 ) 


where U^d is the set of admissible controls. 

Here we solve (OCP) for Nig = 0 and Nig = 1 in sections 3.1 and 3.2 respectively, 
hnding the corresponding optimal open-loop controls. Our results show that the system 
is highly sensitive to small variations of the conditions of the problem, so that a general 
open-loop control for Nig > 1 would not be of practical interest. Instead, we provide in 
section 3.3 a feedback control law for Nig > 1 that substantially reduces the cost of the 
open-loop controls for Nig < 1, and preserves the number of ignitions at a relatively low 
value (A^ig = 4; see later). 


3.1 Optimal open-loop guidance strategies for 

Solving (OCP) for Nig = 0 consists in minimizing C(k,) for K,(to) = kq, with kq = ±1. 
Indeed, if K,(to) = 0, then k ^ U^Ai because D and E are initially not aligned with the 
target ball and no inition process can be used to modify the trajectory of the evader. 
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so Ue(t) ^ Bp{T) for all t G [to, + 00 ). Similarly, solving {OCP) for = 1 consists in 
minimizing C{k) for K{t) of the form K{t) = Kq in a time interval [toN,toFF] and K{t) = 0 
outside. 

We thus seek two control function prohles like those sketched in Fig. [3l 
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- (b) 
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Figure 3: Prohles of the control function K{t) with kq = 1 for each case (a) Wg = 0 and 
(b) ]V„ = 1, 


3.1.1 iVij = 0: 

The following result holds: 

Let Krit)-. M —)■ {—1,0,1} be the following step-function in the time interval [to,tf], 


Hrit) 


kq a t < T 

0 if t > r ’ 


(13) 


where kq = ±1 is the initial value at time to- ^r(^o) = ^o- Then, if tj is sufficiently large, 
there exists an interval [Ta,Ti^] C (to,t/) such that, for all r G [Ta,Ti^], there exists a time 
t G (to,tj) for which the evader is in the interior of the ball of radius r centered in the 
target T. That is: 


V r G [Ta,Ti^], 3 t G (to,t/) such that ||'Ue(t) — Ut\\ < r. (14) 

Moreover, if r ^ 0, then the interval [rQ,r^] shrinks to a single point t* such that there 
exists a time t G (to,tf) for which Ue(t) = ut- See r* in Fig. El^a). 

The proof is based on a continuity argument applied to a (numerical) shooting method. 

Fig. m shows the trajectories of the evader for different values of r. By a continuity 
argument, it is possible to hnd the values of Tq,, and r* with a simple shooting method 
based on comparing the direction of the velocity vector of the evader Vf, with respect to 
the direction towards the target when the control is switched off. See Appendix B for a 
more detailed description of the decision test of the shooting method. 

In this case, the cost of a strategy K-r(t) given by flT^ is C{Kr) = r for all r G [TayT^^], 
so that the optimal strategy is For a sufficiently small value of r (we used r = 10“"^ 

in Fig. Hj), the interval [tq, tJ\ colapses to the value r* and the cost of the optimal control 
is C{k*^) = T*. 

Noticeably, Fig. H] reveals that the system is highly sensitive to small variations of r 
around the optimal value r*: the same variation of r produces a larger variation of the 
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Figure 4: Shooting method for hnding the optimal value r* of the control function shown 
in Fig. [^a), for which \\ueit) — ut\\ < p for some t G (0,f/) (here we used p = 10“^). 
Filled (blue) square denotes target’s position ut = (1,1), empty (red) circle evader’s 
initial position Ue{0) = (6, 0); driver’s initial position 'Ud(O) = (—6, 0) is not depicted. Line 
with (red) circles denote the trajectory of the evader for the optimal value r* = 41.15. 
Wide (black) line denotes the accumulation circle around which the evader turns if the 
control is kept to one. Thin (black) lines denote trajectories of the evader for the following 
non-equispaced values of r: 34, 36, 37, 37.5, 38, 38.4, 38.7, 39, 39.3, 39.6, 39.9, 40.2, 40.4, 
40.6, 40.8, 41, 41.3, 41.6, 42 and 43. 



deviation of Ve with respect to a reference line {e.g., the horizontal line) when r is close 
to r* than when r is far from t*\ for Ar = 2, the angular variation 9 from r = 39.9 to 42 
is larger than 7r/2 (actually, 6^2 — ~ 1.62), and is 10 times smaller from r = 34 to 

36 (036-^34-0.16). 

3.1.2 Wg = 1: 

In the previous case, the optimal control is set to 1 in the whole interval [0,ra). The 
possibility of using one ignition process allows us to consider controls where the system is 
at rest while the driver is approaching the evader, taking advantage of the fact that when 
the driver is far from the evader the circumvention term has (practically) no influence on 
the trajectory of the driver (and therefore, on the behavior of the system); see Remark 2. 

This delay in the ignition process substantially reduces the cost by allowing the use of 
a control function profile like the one depicted in Fig. [3](b), defined by 

4:'W = I n° ‘7", (15) 

ion \ y Q elsewhere ^ ^ 

where ton and toFP are the instants of time in which the control is switched on (k = kq = 
±1) and off (k = 0) respectively. For these control functions, the cost is given by 



12 








so that the {OCP) problem is reduced to hnd the values of ton and topF minimizing topF—^ on- 

These values are found by means of two successive (numerical) shooting methods. Let 
us consider the extreme case where p = 0 (numerically we used p = 10“®), so that the 
solution of the shooting method described in the previous section for iVig = 0 is unique. 

The hrst shooting method consists in hnding, for each value of ton, the (unique) value 
ioFF(^oN) for which there exists a time t G such that Ue{t) = ut- This shooting 

method is the one presented in the previous case where N\g = 0. In fact, in that case, 
foN = 0. This procedure allows us to build a function in such a way that the 

function K|“7(t) is admissible: G t4d- 

Then, the second shooting method consists in hnding the value of foN for which the 
characteristic function of the interval [toN, ^off(^on)] minimizes 




Figure 5: Shooting method to hnd the optimal values and tgppTor which \\ue(t) — ut\\ < 
p for some t G (0,f/) (p = 10“®), for two diherent values of tf: solid (black) line, tf = 60; 
dashed (red) line, tf = 50. Left panel; cost function C(toN) = ^off(^on) — ^on, exhibiting a 
plateau starting at ~ 38.92 (arrow) and situated at C* ~ 3.52. Right panel: zoom of 
the vertical axis, revealing the nonlinear shape of the curve with a minimum at tgjj, closely 
followed by a local maximum at ton ~ 39.82, and a decreasing regime for tf sufficiently 
large. The relative amplitude of the nonlinearity is ~ 7 x 10“®. 

Fig. [5] shows the cost function C as a function of C(foN) = ^off(^on) — ^on- 

When tf is sufficiently large, the cost C(foM) tends to a constant value C* ~ 3.52 which 
is the time is takes to the driver to make the evader turn back towards the target. See 
Fig. [6], where two examples with diherent ignition times ton yield (approximately) the 
same cost C*. 

This value C* constitutes a substantial reduction of the optimal cost found for Wg = 0. 

In fact. Fig. [5] shows that there is a wide range of values of ton yielding a similar value 
of the cost, so that, for a tolerance larger than 10“^, the optimal control would not be 
unique. This would allow the system to accept another criteria or another restriction to 
determine the optimal strategy, which can be for instance to minimize the total time or 
the total distance travelled by the agents (and which would be equivalent to consider the 
cost of back-propellers). 

However, if the variations of C* of order 10“® are relevant, a closer analisys shows 
that the hrst linear decreasing range in Fig. [5l which lasts until t^^ ^ 38.9, is due to 
the fact that during this interval of time [0, t*] the driver is far from the evader, that is. 
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Figure 6: Agents’ trajectories for two ON-OFF controls yielding the same cost C* ~ 3.53 
in Fig. [5l Wide lines (short trajectories, black and red): ton = 42, tgpp = 45.5337; thin 
lines (long trajectories, blue and orange): ton = 50, tgpp = 53.5221. Solid lines denote the 
driver, dotted lines the evader. In the short trajectory the evader reaches the target at 
tf = 65, while it must go until = 81 in the long trajectory. In both cases the turning 
back maneuver takes the same time C* and has the same shape. Right panel: trajectories 
during the turning back maneuver overlap when shifted t = t — 5.64. 


f = 11 Wd — Well S> 1, so that the term with n in Eq. (|3]) can be neglected (it is of order 
with respect to the terms corresponding to the attracting force (of order 0 {r~^)). 
This means that switching on the control before fgj, doesn’t contribute effectively to reach 
the target and is a waste of resources. In fact, the optimal value of the cost is reached at 
fgj,: C(38.9) = 3.5228. When tf is so short that the turning back maneuver can not be 
completed, then tgpp(tQj[) = tf, so that the cost function C(fQN) decreases (again linearly 
with tgu) to zero (but of course the target is not reached). Note that, for all ton, the value 
^off(^qn) will exist provided tf is sufficiently large. 

Interestingly, C(foN) exhibits an abrupt nonlinearity of small amplitude at tgfj; see the 
right panel of Fig. |5l In a pure pursuit regime, the velocity of the agents both converge 
to a constant value Uas due to the friction with the ground. See the lower left panel 
in Fig. [U The nonlinearity is located precisely at the time where the evader reaches 
this constant velocity (uas ~ 0.71). The curve of C(foK) reaches a (local) maximum at 
a value of fan slightly larger than due to that the driver is close to the evader and 
both agents’ velocities have converged to v^s- For larger values of fou in the horizontal 
plateau of Fig. [5l the turning back maneuver is practically identical in space and time; 
see Fig. |6l especially the right panel, where we show that two different values of ton in the 
plateau yield two turning back maneuvers that practically overlap. The slight decrease 
of the plateau shown in the right Panel of Fig. [5] is due to the fact that turning back 
maneuvers are less consuming the farther from the target they takes place, because the 
angle at which the control is switched off is smaller. This situation is reversed (that is, 
the plateau increases) when the target point is above the turning back region -e.g., at 
{x,y) = ( 1 , 2 ). 

Although the relative amplitude of the nonlinearity of C(tQN) is very small (0.024 with 
respect to 3.52), it unexpectedly adds an important complexity to the study of the system: 
the minimum located at tg^ can be global or local depending on tf, therefore complicating 


14 












the search for optimal directions of descent in nnmerical minimization methods. 

Moreover, the high sensitivity of the system detected in the previous section is also in 

t* 

action if control functions like are used when small variations of fgpp with respect 

to ton can occur. 

This section shows that, despite a quite predictive general behavior of the system 
(no signs of chaos have been detected), its high sensitivity strongly suggests the use of 
closed-loop controls. 

3.2 A feedback control law for N[g > 1 

For systems that are subject to conditions of high sensitivity like those described in sec¬ 
tion 3.1, closed-loop or feedback controls offer the possibility of correcting instantaneously 
the state of the system for deviations from the desired behavior [311132] . Moreover, the 
explicit form of the control as a function of time need not to be known a priori in the 
whole time interval [tQ,tf]. In turn, feedback laws have to pay the cost of continuously 
monitoring the position and velocity of the agents. 

We present here a feedback control law based on the following observations: 

1. In real situations, the orientation of the vector Ve{t) used in the shooting method 
can be difficult to observe with the accuracy required by the high sensitivity of the 
system. 

Instead, the alignment a{t) of the driver D and the evader E with the target point 
T is easier to observe and is a good approximation of Ve(t). 

2. When the driver is sufficiently far from the evader, K{t) can be set to zero (Remarks 1 

& 2 ). 

The instantaneous information about the state of the system is processed in real 
time to determine the distance separating both agents r{t) and the alignment a{t). The 
alignment a{t) can be charaterized by the following scalar product (time dependence is 
omitted to lighten notation): 

a(t) = {ut - Ud) ■ {Ue - Ud)^ = (ut - (17) 

The sign of a{t) reveals in which half-plane the target T is with respect to the line DE, 
and can be used to determine the sign of K{t). 

Moreover, |a(f)| is an instantaneous measure of how urgently the control must be set 
to ON. Let us consider a maximal tolerance of deviation a. The feedback control law is 
based on the idea that when |a(t)| is smaller than a, it is possible to consider that T is 
practically on the line DE, so that nit) can be set to OFF, thus saving cost, and when 
|a(t)| > a, the deviation is excessive and the control must be set to ON. The tolerance of 
deviation a is an effective bound for both the angle and the intensities of the velocities 
(a = \\ut — Ud\\ \\ue — Ud\\ cos{ut — Ud,Ue — Ud)^), SO |a| < a restricts also the velocities 
of the agents: a slightly deviated evader at a high speed can miss the target as well as a 
largely deviated evader at a lower speed. 
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Note also that when |a(t)| < a, the control can be switched off provided the evader 
and the target are at the same side with respect to the driver, in order to prevent the 
driver from driving the evader away from the target; that is, K{t) can be set to zero only 
if the scalar product {ug — ut) ■ {ug — Ud) is negative. 

Finally, Remark 2 is introduced into the feedback law by means of the characteristic 
function 


X{t) = 


0 if r^{t) > (52, 

1 if not. 


(18) 


which serves to switch off the control when the driver is far (r^(t) ^ ^ 2 ) from the evader. 
The feedback control law can then be written as follows: 


K^it) = Xit) X 


0 if \a{t)\ < a and {ug — ut) ■ {ug — Ud) < 0, 

sign{a(t)} if |a(t)| > a or {ug — ut) ■ {ug — Ud) > 0. 


(19) 


We have solved the system ([I])-([5|) numerically using the condition r^{t) > 3(52/2 to 
have X{t) =0 in expression ([19]). We have considered an alignment tolerance a = 4x 10“^. 
The rest of values are as in previous sections. Let us refer to this case as case (a). 

The result is that the feedback law reduces substantially the cost obtained with the 
open-loop control in Sec. 3.1: = 1.43, so an improvement of 60% with respect to 

C* = 3.53, with however a slight increase of the number of ignition processes, from iV,* = 1 

to ^ = 4. 

See Figs. [7l[9] and [H](al),(a2). 



Figure 7: Agents’ trajectories for the feedback law kf(^) given in flT^ . with alignment 
tolerance a = 4 x 10“^. Left, whole trajectories; right, detail. Wide solid segments denote 
intervals of time with active control (ON). Small solid circles correspond to onsets and 
ends of such intervals. Numbers denote segments of the same interval of time: Si = 
[39.17,39.55], S 2 = [41.54,41.89], Rg = [43.77,44.11] and R 4 = [45.98,46.32]. The cost is 
4'"^ = 1.43. 


Figs. [7| shows the trajectories of the agents (the whole trajectories in the left panel, 
and a zoom of u^{t) G [0,15] in the right panel). Surprisingly, the trajectory of the driver 
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exhibits an oscillatory behavior around the circular trajectory of the evader, allowing 
the driver to remain closer to the evader than in the previous cases (see, e.g., Fig. Ej). 
Moreover, the time spent with the control in active mode {kf = 1) is surprisingly short 
compared to the time spent in this state in the open-loop control 



0 10 20 30 40 50 60 0 10 20 30 40 50 60 0 10 20 30 40 50 60 



38 40 42 44 46 48 38 42 46 50 54 58 38 42 46 50 54 58 62 


t t t 

Figure 8: Feedback control laws resulting from the three previous situations denoted 
(a), (b) and (c) in Figs. [7] and [HI showing the extreme sensitivity of the system under 
variations of the alignment a* and the switching times of the control K{t). Lower panels 
(abc2): zoom of upper panels (abcl). Resulting costs: = 1.43, = 4.275 and 

Cp'^^ = 4.13. Number of ignition processes are = 4, = 43 and = 16. 

The improvement consists in that the circumvention mode is interrupted (kf = 0) 
during the surrounding motion of the driver around the evader, reducing the time spent 
with Kp = 1 to four small intervals Sj, j = 1,...,4, of total length Cp*^^ = 1.43, and, 

fa) 

accordingly, i\lg = 4. See the wide solid segments in the trajectories of the agents 
in Fig. [7] and Figs |8](al) and (a2), which show the resulting control function HFif) for 
t G [to,tf], with to = 0 and tj = 63. 

The solution found in Fig. [7] with the feedback law flT^ can indeed be considered a 
good solution of the control problem. However, this is not a general situation, as shown 
by the wide range of cases analysed in our numerical simulations, because of the high 
sensitivity exhibited by the system. Let us illustrate this observation here by showing the 
results for two slightly different external conditions; see the cases (b) and (c) in Figs. |H] 

andO 

Case (b) and (c) use an alignment tolerance a = 10“^. Fig. IH] shows that the evader 
follows almost the same trajectory than in case (a) (depicted in the figure to facilitate 
the comparison) and reaches the target with a more accurate orientation. However, such 
a small deviation requires an enormous increase of the use of the lateral propellers, as 
shown in Fig. |S](b2). Not only the cost of having the control set to 1 is larger, Cp ^ = 4.275 
(and larger than with the open-loop controls), but a much greater number of ignition 
processes is required = 43), moreover involving both the right and the left propellers 
alternatively (he., Kp = +1, 0, —1, 0, -|-1 ,... ). 
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Figure 9: Perturbations of the previous case depicted in Fig. [71 (a) perturbation of 
the alignment tolerance: here a = 10“^ instead of 4 x 10“^, and (b) perturbation of 
relative order 10“^ of the hrst interval of activation: here S[ = [39.17,39.6] instead of 
Si = [39.17,39.55]. For t > 39.6 the same feedback law flT^ is used. In (c) the target 
is not reached because t/ is too small; for a larger fj, the target is reached with no 
additional cost (kf = 0 during this extra-time). Dashed lines represent the solution of 
case (a), depicted here for comparison. 


On the other hand, case (c) shows that small variations of the switching times of the 
feedback control can produce huge qualitative differences in the behavior of the agents. We 
have introduced a small perturbation of the first interval of activation Si = [39.17, 39.55], 
by keeping k = 1 until 39.6 instead of until 39.55 (a perturbation of relative size 1 — 
39.55/39.6 = 1.3 x 10“^). For t > 39.6, we use again the feedback law flT^ . so that a 
different control function profile arises. The resulting behavior of the agents is depicted 
in Fig. [ni[c). 

Fig. [9][c) shows that after the perturbation, the agents describe a widely deviated 
trajectory, especially in the case of the driver, with respect to the one described with the 
unperturbed feedback law shown in Fig. [71 Accordingly, the perturbed control function 
prohle, depicted in Fig. [H][c), is signihcantly different from the unperturbed feedback law. 
The driver requires more time to reach the target {t'j = tf + 1 = 64) than in cases (a) and 

(b). Compared with case (b), the cost is higher, although not excessively, Cp ^ = 4.13 (with 
no additional cost for the extra time because the evader moves in the right direction), 
and the number of ignition process is much smaller: = 16. 

Perturbations of the rest of switching times produce similar results, that is, qualitative 
deviations of the trajectories and a larger number of ignition processes, with respect to 
the unperturbed case (a), meaning that, in cases where delays can appear in the instants 
of time in which the control has to be manipulated, important changes of the behavior of 
the system may arise. 

The oscillatory behavior arisen in case (c) can appear to be less convenient but, ac¬ 
cording to our numerical simulations, it is not necessarily worse than the smooth evader¬ 
following trajectory; oscillations can contribute to reduce the number of ignition processes, 
with a low increase of the cost; see left panels (a) in Fig. [TUJ 

The feedback control law is able to drive the evader along a given trajectory, provided 
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Figure 10: Left panels: (al) agents’ trajectories and (a2) feedback control producing an 
oscillatory behavior of the driver and a surprisingly low cost C ~ 4.1. Panel (b): agent’s 
trajectories in a case where the path to follow is a series of 7 targets randomly distributed 
in a radius smaller than 8 from the original location of the evader. Note that for some 
targets the trajectory is smooth (Ti, T 2 , T 3 , T 5 and T 7 ) but for some others oscillations 
are necessary to adjust the alignment (T4 and Tg). Panel (c): evader’s trajectory along a 
sinusoidal path described by a large number of closely spaced targets. 


the trajectory is sufficiently smooth, that is, the trajectory can be described by a series 
of targets; see Fig. [TUlf bl for a series of random targets, and (c) for a sinusoidal trajectory 
described by a large number of closely spaced targets. The study of the behavior of the 
system when the path to follow has a very high curvature or describes very acute corners 
is matter for future work. 


4 Conclusion 

We have presented an agent based model for the guidance by repulsion problem, consisting 
of a system of equations corresponding to Newton’s second law. The system can adopt 
two operating modes that can be controlled by a single parameter K{t)\ for k = 0 , the 
system is in the pure pursuit mode (control in mode OFF, i.e., resting state), and for k = 1, 
the system is in the circumvention mode (control in mode ON, i.e., active state). We have 
shown that by appropriately dehning the function K.(t) : [to,tf] {— 1 , 0 , 1 }, the driver 
can guide the evader to any desired target or along any (relatively smooth) path. We 
have then formulated an optimal control problem (OCP) to hnd the optimal guidance 
strategy minimizing the cost in terms of the number of times the system is activated 
from its resting state and the time the system spends in the active mode. By means of 
(numerical) shooting methods, we have obtained the optimal open-loop strategies for the 
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case where the number of activations is equal or smaller than one, finding that the system 
is highly sensitive to small variations of the act i vat ion/deactivation times. 

These results show that open-loop controls would not be of practical interest in real 
problems due to the presence of external perturbations, thus suggesting the use of a 
feedback law. Taking advantage of the information provided in the study of the behavior 
of the system under open-loop controls for Nig < 1, we have designed a feedback law for 
the case where Nig > 1, which allows to correct in real time for deviations from the desired 
trajectory. We have found that the feedback law is also highly sensitive to small variations 
of the conditions of the problem, in this case, of the accuracy with which the target is 
reached (he., the radius of the target ball). Moreover, our results show that the feedback 
law, and therefore the resulting behavior of the system, are highly sensitive to possible 
delays in the switching times of the control. This means that, in systems or devices where 
the manipulation of the control cannot be carried out at arbitrarily close instants of time, 
the behavior of the driver can exhibit large oscillations that can produce an increase of the 
cost. This may happen in situations where time delays exist in collecting and interpreting 
the data about the state of the system or in the reaction time of the system once the 
control is changed, especially when two consecutive changes are very close in time. 

The main direction for the immediate future work consists in taking into account the 
cost for the driver to get close to the evader, both in time and travelled distance. This 
would correspond to add the cost of back propellers to the cost functional: 

JB{tf)=r]i[ \\ud{t)-Ud{toWdt + ri2tf, ( 20 ) 

Jto 

and hud a feedback control law for the two controls K{t) and tf. 

The interest of guidance by repulsion could also be extended to the case where the 
evader’s behavior has a stochastic component and when multiple agents (evaders and/or 
drivers) are considered. The feedback law will be especially relevant when noise is con¬ 
sidered in both the behavior of the agents and in the manipulation of the data. 
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A Asymptotic value of the velocities at long times 
when = 0 


Numerical simulations show that when K{t) = 0 continuously for a sufficiently large time, 
both agent’s velocities converge asymptotically to the same constant velocity Usat- In that 
state, Vd(t) = Veit) = 0 and, from Eqs. (l6])-(I7j), we have 


C£) '^d Ue 


52 






/^E 
'^D 


Ce Ud - Ue 




( 21 ) 


Thus, comparing norms, we obtain Uas = /(i^eSas), and extracting S^s, we obtain 


5 


as 


/ 


•'ejCES^, - CrSI) 


( 22 ) 


provided z/gC'^ > i^dC^, as it is the case for the values we are considering. Note also that 
a necessary condition to have an effective short-range repulsion acting on the driver is 
that the factor between large parentheses in Eq. ([6]) is positive, so — CR(5f > 0 and 
the radicand in expression (1^ is positive. 


B On the controllability of the driver-evader system 

We show here that driver and evader are prevented from separating inhnitely from each 
other and tend asymptotically to be separated a distance of order one. The proof follows 
the idea of “free agents” used in |2l] (see also iza for a more similar model). 

Definition A driver agent D is said to be a free agent at time t if its distance to the 
evader E is greater than an arbitrarily large positive constant 5 3> 1; that is, rit) = 

\\ud{t) - Ue{t)\\ > s. 

Proposition 1. If the driver D is a free agent, then the system ean be reduced 

as follows: 


Ud{t) = Vd{t), 


Ue{t) 

Vd{t) 

Ve{t) 


Ve{t), 

Cp Udit) - Ue{t) _ 
rUd \\ud{t) - Ue{t)P md^"^ 
<gg Ue{t) - Ud{t) _ 
rOe \\Ueit) - Ud{t)\\^ me^"" 


(23) 

(24) 

(25) 

(26) 


Proof. Using Remark 1. 


□ 
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Lemma 2. Ifrrid/i'd < then, Vt > 0, 


\mf < 

^£) ''^e 


< 


cEi^d 

civ. 


WMtW 


(27) 


Definition Let q = {ud,Vd,Ue,Ve) £ and i7(q): M® —)■ M be the following potential 
functional: 

V^(q) = H\\Ud - UeW) + \\Vdf - \\Vef ■ (28) 

Then, for free agents (he., r{t) > 5) and under the hypotheses of Lemma [2], lL(q) is 
positive. 


Theorem 3. If the driver D is a free agent, then lh(q) is bounded from below and lh(q) 
is negative along the agents’ trajectories defined by the system Conseguently, 

l/(q(t)) converges in time to a minimum which is reached when the distance between both 
agents is S. 


Proof. The time-derivative of V (q) along the agents’ trajectories is given by: 

tV/ N ^ dud „ dvd „ du, „ dv. 


where 

Then: 


= 


dt 

I'^d ^ell 


dt 


md 




dt 

and 


dt 




m. 


eg 


V.. 


(29) 

(30) 


r(q) = 




u. 


\\Ud ^e|| 

Ud U, 


2 ' '^d + '^E'^d 

m, ^ 


\\Ud-Uer "" cl 

^d Ur- i |2 


= --^iWdr+ -p^wve 

'^E 


CE 


D '^d 


Up 


md \\Ud - Well" 
CUd U, 
‘Va, \\Ud ^e|| 


Ed 

md 

. ffp 

m. 


-Vd 


(31) 

(32) 

(33) 


which, under the conditions of Lemma [2l is negative. Then V (q) decreases and is bounded 
from below, so ld(q) has a minimum, which is reached when ||hd(t)|| = 1114(^)11 = 0 and 
r{t) = \\ud{t) — Ue{t)\\ = S (which is the minimum value of r{f) for a free agent). □ 

Thus, agents are prevented from separating infinitely from each other because as soon 
as r{t) > 6, the driver becomes a free agent and is forced to move back towards the evader, 
provided the balance between the mass and the friction of the agents verifies Lemma [2J 
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C Simple shooting method 

Once the evader is turning back towards the target, that is, v^(t) < 0, we check the 
direction of the velocity vector of the evader with the line ET described by the evader 
and the target. Then, if for some time t G (to,tf) the vector Ve{t) points towards a point 
located below T, the tentative value of fopp must be reduced; if instead, at the hnal time 
tf, the vector Ve{t) points towards a point located above T, then the tentative value of 
toFP must be augmented. 

That is: Given a value of ^qn, take an initial value of fopp larger than tou and: 

0. Solve the system ([I])-® with for t G (toG/) and anotate the value of tf,, 

which is the hrst time such that < 0. If no such time is reached, this means 

that foFF < tf,, so take a larger value of fopF and shoot again (goto 0). 

1. For each time t G evaluate the instantaneous alignment a{t) of the velocity 

vector of the evader Veit) with respect to the target point ut'- 

a(t) = (u^ - - (u^ - (34) 

Then, if a(t) < 0, take a smaller value of fopp and shoot again (goto 0). 

2. If at time tj the velocity vector of the evader is still pointing above the target, that 
is, a{tf) > 0, then take a larger value of fopp and shoot again (goto 0). 

The new value of fopp for the next shoot can be selected with a simple method {e.g., 
bisection). 

Stop when |a(t)| < e, for a small value of the tolerance e; the value of topp(^oN) has been 
found, proceed to the next value of ton. 
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